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Abstract. I n order to generate visually and conceptually meaningful digital 
maps for small scales, it is necessary to select only the most relevant 
topological information to be displayed. On zoom levels covering 
continents, countries, or states there are usually far too many cities and a 
too dense street network to be shown on small or medium size displays. 
However, the decision of which cities and network I inks to show on the map 
cannot be done by an attribute- based selection of features. An adaptive 
method is required to thin out information in densely populated areas while 
at the same time showing enough information for sparse areas. In this 
paper we introduce our approach to place and street network selection for 
mobi I e maps as i mpl emented i n OpenSci enceM ap. 
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1. Introduction 

Digital topographic maps became ubiquitous on the I nternet and all kind of 
mobile devices. As the maps are viewed on a large variety of devices with 
fundamentally different display sizes, it is required to thin out and 
generalize the street network especially on low zoom levels covering 
continents, countries, states. This is required to reduce the amount of visual 
information and to enhance map legibility. Depending on the size of the 



display, we need to tailor the amount of information in the map to the 
visual interface: the smaller the display, the I ess information can be shown. 
As visual entities need space to be displayed and a certain size to be clearly 
visually discriminated by the user, the amount of information is also largely 
independent from the physical resolution of thedisplay. 




Figure 1; Street network only displaying highway: motorway,highway: trunk, and 
highway: primary. The network is too dense and cluttered for understanding 
significant connectivity and routes on the map, especially on small displays. There 
are too many i ndi vidual entities displayed, as a consequence most of them are too 
small and too thin to be recognized without effort. 



I n dense regions in the world, eg., parts of central Europe like the Ruhr- 
region in Germany, it is not possible to just apply a selection of entities 
based on a classification hierarchy. E.g., a naive heuristic is to display only 
settlements eg., "cities" or places with a population of more than 100000 
inhabitants on a given zoom level. Depending on the geographic area, the 
resulting set of places still can be far too dense, as all displayed places need 
to be labeled on the map and labels require space to be displayed. In 
contrast to that, in sparsely populated regions or regions with smaller 
places, too few representative places might be found and the areas can 
appear empty although they are populated. As a consequence, it is 
necessary to carefully select which places and corresponding labels are 



allowed to consume display space, and which places in turn get superseded 
by them. In addition to showing where places are, it is necessary to show 
the connectivity between them. A straightforward approach to select the 
street network is to only display streets of certain types. Similar to the 
amount of places, the resulting street network can be far too dense (see 
Figure 1). However, depending on the classification scheme and the actual 
street network, many of the displayed places will not be connected by a 
coherent network (see F i gure 2) . 

Our method addresses both goals: a meaningful distribution of cities of 
varying sizes for dense and sparse areas at the same time, as well as the 
selection of a coherent street network between them independent from the 
network cl assi f i cati on . 




Figure 2: Disconnected street network in OpenStreetM ap only displaying streets 
tagged with highway: motorway, highway:trunk. Not all places of e.g., Figure 4 
areconnected and the network is highly fragmented. 



2. Related Work 



The automatic selection of places and street network generalization is a 
major challenge in multi scale cartography. It always demands a rigorous 
reduction of the amount of places displayed on maps, which is typically 
done with clustering and/ or ranking algorithms (e.g., Yan & Weibel 
(2008)). These places need to be connected with a street network which 
should be chosen such that significant connectivity and structural 
properties are preserved. In Chaudhry & Mackaness (2006) the authors 
develop method to generalize street networks for lower zoom levels 
according to the 'rules of good continuation' introduced by Bruce et al. 
(2003). Their model selects salient streets with respect to human visual 
perception. Touya & Duchene (2011) propose a complex method 
incorporating different strategies for different thematic generalization 
rules. H owever, their framework targets medium to large scale maps, where 
the variety of rules are applicable. In a recent work, Samsonov & 
Krivosheina (2012) develop an integrated algorithm combining Voronoi 
diagrams with a multi -criteria weight for the selection of places and a 
accumulative global route selection for the generation of thestreet network. 

3. Our Approach 

Our goal is to develop an integrated method for selecting representative 
places and a corresponding street network for higher zoom levels. Our 
particular focus is on an even distribution of reference places on small 
scales, incorporating settlements of varying sizes with the target of making 
geographic areas rapidly recognizable. 

This work is implemented for our vector-tile project OpenScienceMap 1 
(Schmid et al. 2013). OpenScienceMap itself is built upon the voluntary 
collected OpenStreetMap 2 (OSM) dataset. All classification notions in this 
paper refer to the tagging scheme and guidelines of OSM. 



1 http://www.OpenScienceMap.org 



2 http://www.osm.org 



3.1 Identification of Relevant Places 

The identification of suitable places as spatial references on a map is a 
fundamental requirement for topographic maps. As each selected place 
potentially supersedes a neighbored place with comparable properties (eg., 
similar population, historical relevance, popularity, connectivity) we need a 
method to select a good candidate. I n many cases there is an obvious best 
choice, however, in many other cases there are good reasons for displaying 
each of them. An example is the Rhine-Ruhr region in Germany with an 
agglomeration of cities of similar population, importance for the region, 
and popularity. In all cases the reasons for favoring a pi ace over another is a 
mixture of algorithmically graspable parameters (eg., population, network 
connectivity, geographic location, etc.) and rather random contextual 
information (eg., tourism, infrastructure, culture, etc.). 

In OpenScienceMap we select the places according to their population and 
geographic location. We use a population- based clustering of places and a 
street network computation based on Voronoi tessellation between 
candidate places: with the population- based clustering we identify the best 
candidate for a given geographic area, and with the tessellation weensurea 
well-connected street network between these! ected places. 




Figure 3: Result of SELECT-PLACES with a minimal population of 10.000 as a 
lower threshold. Places represented by large dots are selected, brightness indicates 
the size of the population (dark = large, bright = small). 



Selection of Reference Places 

As we are interested in a geographically balanced distribution of places, it is 
not suitable to select places solely according to their classification or actual 
population. In OSM cities (tag:place=city) are defined as places with a 
population of 100.000 inhabitants and above. However, as the dataset is 
collected and annotated by volunteers, some of the tags and values are 
either not present or not correct (eg., some users tag settlements with 
10.000 inhabitants as cities). In this case we would not only obtain asetof 
places biased by agglomerations but also by wrongly annotated data in 
OSM. 

In order to identify the best candidate for a given area in the map, we 
implemented SELECT-PLACES (see Algorithm 1). SELECT-PLACES 
clusters places based on a lower bound of population, i ndependent from the 
type tagging (eg., tag:place=city vs. tag:place=town) for a given portion of 
space on the map. A city is selected if there is no larger city within a given 
distance. 

SELECT-PLACES works as follows: The input is given as a set of all places 
places represented by a tuple of location and population, and a minimal 
distance definition minDist. minDist defines a mapping of population 
ranges to distances: It is used to differentiate between less populated places 
that require a greater distance to other places to be selected compared to 
more populated ones. Asa result of this definition, in areas with two equally 
large cities the two cities can be closer to each other than two cities with 
large difference in population; in this case the larger city will supersede the 
smaller one. 

Figures 3 & 4 show the results of SELECT-PLACES with different 
parameterizations: Figure3 shows the result of a lower threshold of 10.000 
for input places, Figure 4 shows the result for 50.000 as the lower bound of 
the input. The result of SELECT-PLACES is a candidate set for the final 
map generation step. Although already rigorously reduced, the set might 
still be too large to be rendered along with the corresponding labels. The 
final selection of the places takes place in the applied labeling algorithm 
and itsparameters(compareFigure4withthefinal mapsin Figures7&8). 



Figure 4: Result of SELECT-PLACES with a minimal population of 50.000 as a 
lower threshold. Places represented by large dots are selected, brightness indicates 
the size of the population (dark = large, bright = small). 



Algorithm: SELECT-PLACES 

Input: Set of pi aces where pi ace is a tuple of (location, population) 

minimal-distance definitions minDist 
Output: Reduced set of places 



1: result «- places 

2: forall pi, p2 in asPairs(pLoces) : 

3: if (population (pi) < population(p2) : 

4: of = distance(location(pl) , location(p2) ) 

5: if getMinimalDistanceForPopulation(minDist J pi) < of: 

6: result «- result \ {pi} 

7: return result 



Algorithm 1 Identification of reference places. 



If a place pi has a smaller population than p2, and if the distance between 
them issmaller than the minimal distance for the population size as defined 
in minDist, we remove pi from the result set. This process is repeated for 
all pairs of places. This simple algorithm guarantees that all areas are 
represented by locally important places: places are selected if they are 
relatively larger than places in their surroundings. At the same time it 
guarantees an adaptive distribution of places with respect to population size 
and geographic area: it allows two or more equally large places to be 
displayed close to each other, while it ensures the selection of best reference 
places in sparsely populated regions. The minimal distance ranges in 
minDist need to be chosen according to the addressed map scale and 
display size: the smaller the display, the larger the defined distances needs 
to be; mutually preserved (visual) distance is the crucial parameter for the 
density of places on the map. 



3.2. Network Generation Between Selected Places 

To show the connectivity between the chosen places of algorithm SELECT- 
PLACES, we need to select a subset of the avai I able street network between 
them. The network should be chosen such that the most important ways of 
the covered geographic area are represented. Although there exists a 
hierarchy of different types of streets (eg. highway: motorway, 
highway:trunk, and highway: primary in OSM) it is not possible to just 
select a street or a series of streets between two places based on this 
classification. A street network is a complex mesh of different types of 
streets and in most cases it only is possible to connect two places with a 
combination of different street types. Reasons can be found in the physical 
street network itself, the geographic definition of a place in a dataset, and 
the user- based taggi ng i n the case of OSM . 

I n a complex network built upon different types of streets it is impossible to 
connect all places with streets of equal types. Aspects like topography, 
population density, or climatic conditions require adapting the network to 
the environment it is built in. In addition to the physical network, places 
have a geographic definition, usually located in their center. However, the 
centers of places are typically streets of local types (eg., 
highway: I iving_ street, highway: tertiary, or highway: secondary) which 
makes it necessary to connect heterogeneous types to form a coherent 



connection between two places. And finally the voluntary collection and 
tagging of data for OSM naturally introduces inconsistencies as the 
classification of streets is very often subject to individual perception and 
experience. 

Selection of Street Network Subset 

To compute a representative street network, we need to identify the 
structurally most important links between the places shown on the map. I n 
contrast to other approaches we do not compute all (shortest) routes 
between all pairs of places and select the repeatedly occurring ones. I n our 
approach we have a rather local view on the network: we only search for 
links between adjacent places as all identified links will sum up to an 
interconnected network. Neighborhood in a point cloud (like the set of 
places returned by SELECT-PLACES) is a difficult concept, as the local 
density of the places has strong influence on all parametric approaches 
(e.g., distance- based neighborhood) and will result in highly varying 
network densities: the more places are classified as neighbored, the denser 
will the resulting network be. A more conceptual view on neighborhood will 
guarantee an evenly represented network without overrepresentation in 
dense and underrepresentation in sparsely populated regions. 

Our concept of neighborhood is based on a Voronoi diagram of the selected 
places as identified in SELECT-PLACES. With Voronoi cells we can identify 
the adjacent places without any numeric parameters by browsing the 
adjacent cells of the corresponding diagram. Our algorithm COMPUTE- 
STREETNETWORK (see Algorithm 2) computes a street network based on 
the Voronoi diagram of the selected places by considering Voronoi 
neighborhood of 2 nd degree. This practice ensures the detection of large 
i nf rastructural links, which otherwise would not be reliably represented by 
just considering the direct neighborhood. An equal method is the 
application of a Delaunay triangulation of the point set of places. In this 
representation one can identify the same neighbors as in our Voronoi 
approach by navi gati ng al ong the mesh of tri angl es. 



Algorithm: COMPUTE-STREETNETWORK 

I nput: Set of places (places, voronoi-diagram, street- network) 

Output: Reduced street network between places 



1: network «- {} 

2: forall pi in pLaces: 

2: forall /?-,• in getNeighborCells(pi, voronoi-diagram): 

4: network *■ network U corrputeRoutejbi , n^, street -network) 

5: forall n k in getNeighborCells^n^^ voronoi-diagram): 

6: network «- network U corrput eRout ejbi , street-network) 

7: return network 



Algorithm 2: Identification of route candidates. 



COMPUTE-STREETNETWORK works as follows: it requires a set of places 
places (e.g., computed by SELECT- PLACES), the Voronoi diagram voronoi- 
diagram for them, and the corresponding source street network street- 
network. We now iterate through all places and identify their direct 
Voronoi neighbors. For each identified neighbor we compute a route and 
add it to network, the resulting reduced street network. As we want to 
identify the most important network links, we implement the route 
computation with a hierarchical router preferring higher classified street 
types (e.g. highway: motorway > highway:trunk > highway: primary > 
highway: secondary). This converges to a completely connected network 
with a focus on main infrastructural links. We repeat the steps for all 2 nd 
degree neighbors of the selected place to ensure the detection of indirect 
I i nks such as I arge motorways crossi ng a regi on wi thout bei ng part of a I ocal 
hierarchical shortest route (see Figure 5 for an illustration of 1 st and 2 nd 
degree neighbors). When all routes are computed COMPUTE- 
STREETNETWORK returns the subset of the input network (see Figure 6). 
To render the street network on the final map (see e.g., Figures 7 & 8), it is 
necessary to apply geometric line simplification algorithms. 



Figure 5: Voronoi-diagram for selected places, links between nearest neighbors 
are red, neighbors of 2 nd degree blue. The extended neighborhood ensures the 
detection of important network links which otherwise would be neglected by direct 
adjacency. 




Figure 6: Result of the street network computation for the Voronoi diagram in 
Figure 5. The street network is homogeneously distributed across the map and at 
the same ti me sparse. 



4. Conclusion 



Digital mu I ti scale maps require substantial generalization for small scales. 
This data reduction is necessary to make the displayed information 
understandable. One major challenge is to reduce the amount of displayed 
places and links of the street network between them. However, it is not 
possible to select places and streets only by their classification and fixed 
parameters, as the resulting information would be distributed unevenly 
across the map. In this paper we describe our approach for selecting 
reference places of varying sizes and a connecting street network for lower 
zoom levelssuch as conti nents, countries, and states. 




Figure 7: Street network and reference places in OpenScienceMap on zoom level 
4. The final labeling algorithm supersedes some of the places as computed in 
SELECT-PLACES. 



Our approach consists of an adaptive clustering method for identifying 
reference places for geographic partitions in the map, and a Voronoi 
diagram- based method for determining the street network. Our results 
show an even distribution of places and a similarly evenly distributed street 
network across highly heterogeneously populated regions (e.g., western vs. 



eastern Europe). Our work is implemented and available in the 
OpenSci enceM ap 3 dataset (see F i gures 7 & 8) . 




Figure 8: Street network and reference places in OpenScienceMap on zoom level 
7. The final labeling algorithm supersedes some of the places as computed in 
SELECT-PLACES. 



Acknowledgements 

We gratefully acknowledge support of the German Research Foundation 
(DFG) under grant number GRK 1498. 



References 

Bruce, V., Green, R. P. & Geogreson, A. (2003). Visual perception physiology, 
psychology, ecology. 4th edn, Psychology Press New York. 

Chaudhry, 0., & Mackaness, W. (2006). Rural and urban road network 
generalisation: deriving 1 250,000 from OS MasterMap. Proceedings ICC 
2005. 



3 http://www.OpenScienceMap.org 



Samsonov, T. E., & Krivosheina, A. M. (2012). J oint generalization of city points 
and road network for small-scale mapping. Proceedings Gl Science 2012. 

Schmid, F., J anetzek, H., Wladysiak, M., & Hu, B. (2013). OpenScienceMap: open 
and free vector maps for low bandwidth applications. In Proceedings of the 
3rd ACM Symposium on Computing for Development 2013. 

Touya, G., & Duchene, C. (2011). CollaGen: collaboration between automatic 
cartographic generalisation processes, Advances in Cartography and Gl Science. 

Yan H, & Weibel, R. (2008). An algorithm for point cluster generalization based on 
thevoronoi diagram. Computers &Geosciences 34(8): 939-954. 



